IN THE CLAIMS : 

Please amend the claims as shown below, 
subject application, read as follows: 



The claims, as pending in the 



1. (Currently Amended) A data processing apparatus capable of 
communicating with a server and a browser apparatus via a network, comprising: 

means for obtaining data from the server according to a request from the 
browser apparatus; 

means for creating synthetic voice data indicating a part or the whole of the 
contents of the obtained data; 

means for storing the synthetic voice data; 

means for forming data by adding to the obtained data an identifier which is 
comprised of text data and which indicates indicating a location where the synthetic voice 
data is stored; and 

means for providing said browser apparatus with the formed data. 

2. (Withdrawn) A data processing apparatus capable of communicating 
with a server and a browser apparatus via a network, comprising: 

means for obtaining data from the server according to a request from the 
browser apparatus; 

means for checking whether the contents of the obtained data include a 
content requiring a response from said browser apparatus; 
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means for forming data by adding to the obtained an identifier indicating a 
recipient for performing voice recognition for the response sent by voice data from said 
browser apparatus based on a result of the checking performed by the checking means; and 

means for providing said browser apparatus with the formed data. 

3. (Withdrawn) The apparatus according to claim 2, further comprising 
recognizing means for performing voice recognition for voice data related to the response, 
when the voice data is supplied from said browser apparatus to said recipient. 

4. (Withdrawn) The apparatus according to claim 3, further comprising: 
means for forming response data in a form suited to a server for receiving 

the response on said network, on the basis of the result of recognition by said recognizing 
means; 

and means for providing the response data to said server. 

5. (Withdrawn) The apparatus according to claim 2, further comprising: 
means for forming a recognition grammar for recognizing voice data related 

to each of a plurality of predetermined items, when the response is to be selected from said 
plurality of items; 

means for determining, on the basis of the recognition grammar, to which 
item the voice data related to the response from said browser apparatus corresponds; 

means for forming response data in a form suited to a server for receiving 
the response on said network, in accordance with each item; and 

means for providing the response data to said server. 
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6. (Withdrawn) The apparatus according to ciaim 5, wherein the response 
data is formed before the formed data is provided to said browser apparatus. 

7. (Currently Amended) A browser system comprising a browser 
apparatus, a server, and a data processing apparatus capable of communicating with said 
browser apparatus and said server via a network, wherein said data processing apparatus 
comprises: 

means for obtaining data from the server according to a request from the 
browser apparatus; 

means for creating synthetic voice data indicating a part or the whole of the 
contents of the obtained data; 

means for storing the synthetic voice data; 

means for forming data by adding to the obtained data an identifier which is 

comprised of text data and which indicates indicating a location where the synthetic voice 

data is stored; and 

means for providing said browser apparatus with the formed data, 

wherein said browser apparatus comprises means for acquiring the synthetic 

voice data from the location indicated by the identifier and outputting a voice related to the 

synthetic voice data. 

8. (Withdrawn) A browser system comprising a browser apparatus, a 
server, and a data processing apparatus capable of communicating with said browser 
apparatus and said server via a network, wherein said data processing apparatus comprises: 
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means for obtaining data from the server according to a request from the 
browser apparatus; 

means for checking whether the contents of the obtained data include a 
content requiring a response from said browser apparatus; 

means for forming data by adding to the obtained data an identifier 
indicating a recipient for performing voice recognition for the response sent by voice data 
from said browser apparatus based on a result of the checking performed by the checking 
means; 

means for providing said browser apparatus with the formed data: 

recognizing means for performing voice recognition for voice data related to 
the response, when the voice data is supplied from said browser apparatus to said recipient; 

means for forming response data in a form suited to said server for receiving 
the response, on the basis of the result of recognition by said recognizing means; and 

means for providing the response data to said server, and said browser 
apparatus comprises; 

means for inputting a voice; 

means for forming voice data on the basis of the input voice; and 
means for supplying the formed voice data to the recipient indicated 
by the identifier in the formed data from said data processing apparatus. 

9. (Withdrawn) A browser system comprising a browser apparatus, a 
server, and a data processing apparatus capable of communicating with said browser 
apparatus and said server via a network, wherein said data processing apparatus comprises: 
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means for obtaining data from said server according to a request from said 
browser apparatus; 

means for creating synthetic voice data indicating a part or the whole of the 
contents of the obtained data; 

means for storing the synthetic voice data; 

means for forming first data by adding to the obtained data a first identifier 
indicating a location where the synthetic voice data is stored; 

means for providing said browser apparatus with the first data; 

means for checking whether the contents of the obtained data include a 
content requiring a response from said browser apparatus; 

means for forming second data by adding to the obtained data a second 
identifier indicating a recipient for performing voice recognition for the response sent by 
voice data from said browser apparatus based on a result of the checking performed by the 
checking means; 

means for providing said browser apparatus with the second data; 

recognizing means for performing voice recognition for voice data related to 
the response, when the voice data is supplied from said browser apparatus to said recipient 
means for forming response data in a form suited to said server for receiving the response, 
on the basis of the result of recognition by said recognizing means; and 

means for providing the response data to said server, 

wherein said browser apparatus comprises: 

means for acquiring the synthetic voice data from the location 
indicated by the first identifier in the first data from said data processing apparatus 
and outputting a voice related to the voice data; 
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means for inputting a voice; 

means for forming voice data on the basis of the input voice; and 
means for supplying the formed voice data to the recipient 

indicated by the second identifier in the second data from said data processing 

apparatus. 

10. (Currently Amended) A data processing method of a data processing 
apparatus capable of communicating with a server and a browser apparatus via a network, 
said method comprising the steps of: 

obtaining data from the server according to a request from the browser 

apparatus; 

creating synthetic voice data indicating a part or the whole of the contents of 
the obtained data; 

storing the synthetic voice data; 

forming data by adding to the obtained data an identifier which is comprised 
of text data and which indicates indicating a location where the synthetic voice data is 
stored; and 

providing the browser apparatus with the formed data. 

1 1 . (Withdrawn) A data processing method of a data processing apparatus 
capable of communicating with a server and a browser apparatus via a network, said 
method comprising the steps of: 

checking whether the contents of the obtained data include a content 
requiring a response from the browser apparatus; 
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forming data by adding to the obtained data an identifier indicating a 
recipient for performing voice recognition for the response sent by voice data from the 
browser apparatus based on a result of the checking performed in the checking step; and 

providing the browser apparatus with the formed data. 



12. (Withdrawn) The method according to claim 11, further comprising the 
recognition step of performing voice recognition for voice data related to the response, 
when the voice data is supplied from the browser apparatus to the recipient. 



13. (Withdrawn) The method according to claim 12, further comprising the 

steps of: 

forming response data in a form suited to a server for receiving the response 
on the network, on the basis of the result of recognition in the recognition steps and 
providing the response dada to the server. 



14. (Withdrawn) The method according to claim 11, further comprising the 

steps of: 

forming a recognition grammar for recognizing voice data related to each of 
a plurality of predetermined items, when the response is to be selected from the plurality of 
items; 

determining, on the basis of the recognition grammar, to which item the 
voice data related to the response from the browser apparatus corresponds; 

forming response data in a form suited to a server for receiving the response 
on the network, in accordance with each items; and 
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providing the response data to the server. 

15. (Withdrawn) The method according to claim 14, wherein the response 
data is formed before the formed data is provided to the browser apparatus. 

16. (Currently Amended) A recording medium recording a program which 
allows a computer capable of communicating with a server and a browsesr apparatus via a 
network to function as: 

means for obtaining data from the server according to a request from the 
browser apparatus; 

means for creating synthetic voice data indicating a part or the whole of the 
contents of the obtained data; 

means for storing the synthetic voice data; 

means for forming data by adding to the obtained data an identifier which is 
comprised of text data and which indicates indicating a location where the synthetic voice 
data is stored; and 

means for providing the browser apparatus with the formed data. 

17. (Withdrawn) A recording medium recording a program which allows a 
computer capable of communicating with a server and a browser apparatus via network to 
function as: 

means for obtaining data from the server according to a request from the 
browser apparatus; 



means for checking whether the contents of the obtained data have contents 
requiring a response from said browser apparatus; 

means for forming data by adding to the obtained data an identifier 
indicating a recipient for performing voice recognition for the response sent by voice data 
from said browser apparatus based on a result of the checking performed by the checking 
means; and 

means for providing said browser apparatus with the formed data. 

1 8. (Withdrawn) The medium according to claim 17, wherein said program 
comprises a program which allows a computer to function as recognizing means for 
performing voice recognition for voice data related to the response, when the voice data is 
supplied from said browser apparatus to said recipient. 

19. (Withdrawn) The medium according to claim 18, wherein said program 
comprises a program which allows a computer to function as: 

means for forming response data in a form suited to a server for receiving 
the response on said network, on the basis of the result of recognition by said recognizing 
means; and 

means for providing the response data to said server. 

20. (Withdrawn) The medium according to claim 17, wherein said program 
comprises a program which allows a computer to function as: 



- 10- 



means for forming a recognition grammar for recognizing voice data related 
to each of a plurality of predetermined items, when the response is to be selected from said 
plurality of items; 

means for determining, on the basis of the recognition grammar, to which 
item the voice data related to the response from said browser apparatus corresponds; 

means for forming response data in a form suited to a server for receiving 
the response on said network, in accordance with each item; and 

means for providing the response data to said server. 

21 . (Withdrawn) The medium according to claim 20, wherein the response 
data is formed before the formed data is provided to said browser apparatus. 

22. (Previously Presented) The apparatus according to claim 1, wherein the 
obtained data is described in a markup language, and the identifier is added to the obtained 
data as a tag corresponding to the markup language. 

23. (Withdrawn) The apparatus according to claim 2, wherein the obtained 
data is described in a markup language, and the identifier is added to the obtained data as a 
tag corresponding to the markup language. 

24. (Previously Presented) The system according to claim 7, wherein the 
obtained data is described in a markup language, and the identifier is added to the obtained 
data as a tag corresponding to the markup language. 
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25. (Withdrawn) The system according to claim 8, wherein the obtained 
data is described in a markup language, and the identifier is added to the obtained data as a 
tag corresponding to the markup language. 

26. (Withdrawn) The system according to claim 9, wherein the data 
provided by said server is described in a markup language, and the identifier is added to the 
data as a tag corresponding to the markup language. 

27. (Previously Presented) The method according to claim 10, wherein the 
obtained data is described in a markup language, and the identifier is added to the obtained 
data as a tag corresponding to the markup language. 

28. (Withdrawn) The method according to claim 11, wherein the obtained 
data is described in a markup language, and the identifier is added to the obtained data as a 
tag corresponding to the markup language. 

29. (Previously Presented) The medium according to claim 16, wherein the 
obtained data provided on said network is described in a markup language, and the 
identifier is added to the obtained data as a tag corresponding to the markup language. 

30. (Withdrawn) The medium according to claim 17, wherein the obtained 
data is described in a markup language, and the identifier is added to the obtained data as a 
tag corresponding to the markup language. 
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31. and 32. (Canceled) 

33. (Withdrawn) A data processing apparatus capable of communicating 
with a server and a browser apparatus via a network, comprising: 

means for obtaining data from the server according to a request from the 
browser apparatus; 

means for creating synthetic voice data indicating a part or the whole of the 
contents of the obtained data; 

means for storing the formed voice data; 

means for adding to the obtained data a first identifier indicating a location 
where the synthetic voice data is stored; 

means for checking whether the contents of the obtained data include a 
content requiring a response from said browser apparatus; 

means for further adding, based on a result of the checking performed by the 
checking means, a second identifier indicating a recipient for performing voice recognition 
for the response sent by voice data from the browser apparatus to the obtained data to 
which the first identifier is added; 

means for providing said browser apparatus with the obtained data to which 
the first identifier or the first and second identifiers are added; 

recognizing means for performing voice recognition for voice data related to 
the response, when the voice data is supplied from said browser apparatus to said recipient; 

means for forming response data in a form suited to said server for receiving 
the response, on the basis of the recognition result by said recognizing means; and 

means for providing the response data to said server. 
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